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ABSTRACT 

(U)  A  general  survey  is  made  on  the  data  collected  by  the  Village 
Information  System,  Thailand  project.  Principal  component  analysis  and 
principal  factor  analysis  data  reduction  techniques  are  applied  to  the 
data  for  selected  areas  in  northeast  Thailand  and  the  results  are  com- 
pared. Algebraic  models  are  applied  to  a  selected  variable  of  the  data 
and  forecasting  techniques  applied  to  each  model  to  predict  the  value 
of  the  variable  in  the  next  time  period.  Conclusions  are  presented  con- 
cerning the  operational  usefulness  of  the  analytical  techniques  applied 
to  the  data. 
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I.      INTRODUCTION    (U) 


A.  PURPOSE  (U) 

(U)  This  thesis  continued  the  study  of  insurgency  incident  reports 
from  Thailand  which  was  begun  in  a  thesis  by  Major  P.  E.  Gardner,  USMC 
[ 1 ] .  Gardner  described  and  documented  the  development  of  the  insurgency 
in  Thailand,  within  the  security  classification  level  of  this  document, 
and  the  establishment  of  the  Village  Information  System,  Thailand 
project  to  collect  and  process  data  concerning  the  insurgency.  That 
information  is  available  and  will  not  be  repeated  here. 

B.  SCOPE  OF  ANALYSIS  (U) 

(U)  An  expanded  data  base,  containing  approximately  four  times  the 
initial  number  of  incident  reports,  was  available  for  this  thesis.  This 
data  base  was  surveyed  with  respect  to  the  information  it  purported  to 
contain,  information  it  was  believed  it  should  contain,  the  manner  of 
presentation  of  the  collected  information,  and  the  observed  information 
in  the  incident  report  files  themselves.  These  areas  were  evaluated, 
critiqued,  and  recommendations  made  for  changes  or  additions  which  the 
author  believed  might  improve  the  usefulness  of  the  data  base.  It  was 
fully  realized  that  such  a  survey  and  recommendations  could  only  be  made 
in  a  very  general  sense  since  the  author  was  several  thousand  miles 
away,  had  no  personal  experience  concerning  the  insurgency  or  counter- 
insurgency  activities  in  Thailand,  and  had  only  the  data  tape  and  a  few 
after-action  and  trip  reports  relating  directly  to  the  data  base  for  the 
thesis . 
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(U)  The  factor  analysis  technique  applied  by  Gardner  was  applied 
to  the  expanded  data  for  the  same  areas,  and  the  resulting  factor 
patterns  compared  with  those  obtained  by  Gardner.  Two  different  tech- 
niques of  factor  analysis  were  then  applied  to  a  new  major  geographical 
area  and  its  major  subdivisions.  The  two  factor  analysis  techniques 
were  compared  for  the  different  size  data  sets  and  also  using  sets  of 
generated  data  with  known  underlying  probability  distributions  for  the 
variables  under  consideration. 

(U)  Prediction  of  levels  of  insurgent  activity  based  on  the  infor- 
mation contained  in  the  data  was  considered.  Criteria  for  predictions 
and  forecasting  techniques  in  a  counterinsurgency  environment  were 
developed,  and  three  algebraic  forecasting  models  were  presented,  applied 
to  six  geographical  areas  representative  of  different  patterns  of  insur- 
gency, and  evaluated  for  applicability  and  usefulness  as  an  aid  in 
conducting  a  counterinsurgency  program. 

(U)  It  will  be  noted  that  there  are  some  variations  in  the  number  of 
incident  report  files  from  a  given  area  in  different  phases  of  the 
analysis  effort.  This  was  a  result  of  the  elimination,  either  manually 
or  by  the  computer,  of  some  incident  report  files  counted  in  the  total 
of  6264  reports  constituting  the  data  base.  The  eliminated  files  were 
those  in  which  information  was  missing  in  the  variables  necessary  for  a 
particular  analysis. 
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C.   CLASSIFICATION  (U) 

(U)  The  overall  security  classification  of  this  thesis  is  CONFI- 
DENTIAL, Not  Releasable  to  Foreign  Nationals.  The  information  on  the 
data  tapes  is  UNCLASSIFIED  as  recorded,  but  the  description  of  an  inci- 
dent report  in  terms  of  the  variable  code  definitions  is  CONFIDENTIAL. 
As  the  material  is  concerned  with  the  collection  and  evaluation  of 
intelligence  information,  it  was  placed  in  Group-3  in  accordance  with 
Department  of  Defense  Directive  5200.10. 

II.  VILLAGE  INFORMATION  SYSTEM,  THAILAND   (U) 

(U)  The  background  and  development  of  the  Village  Information 
System,  Thailand  (VIST)  project  has  been  described  and  documented  by 
Gardner  [1],  and  no  further  information  in  these  areas  was  received. 
A  second  magnetic  tape  of  VIST  incident  data  (VIST  Task  2)  was  made 
available  by  the  Stanford  Research  Institute  subsequent  to  the  comple- 
tion of  the  analysis  by  Gardner,  and  it  was  this  second,  expanded  base, 
tape  which  provided  the  data  set  for  the  analysis  which  was  conducted 
in  the  preparation  of  this  thesis. 
A.   EXPANDED  VIST  DATA  SET  (U) 

(U)  The  data  set  used  contains  6264  incident  files,  compared  to 
1592  in  the  initial  set,  for  a  33  month  period  from  January  1967  through 
September  1969.  The  80  column  format  for  an  incident  file  is  shown  in 
Appendix  A.  In  addition  to  providing  coverage  for  an  enlarged  time 
span,  the  expanded  data  set  also  increased  the  area  coverage  from  six 
Changwats  to  seven  and  added  many  Amphoes  to  the  initial  six  Changwats. 
The  expanded  lift  of  geographical  'areas  and  their  codes  are-  shayn  in 
Appendix  B. 
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B.   GENERAL  ANALYSIS  OF  VIST  DATA  (U) 

(U)  Before  any  attempt  was  made  to  apply  statistical  or  mathematical 
techniques  of  analysis  to  the  data  set,  a  general  critical  survey  of  the 
data  was  conducted  so  that  a  basic  evaluation  of  the  data  set  could  be 
made.  This  survey  considered  both  the  overall  aspects  of  the  incident 
data  reporting  and  recording  system  within  the  VIST  project  with  respect 
to  the  80  column  coded  representation  of  a  report,  and  the  specific 
elements  of  reported  information  contained  within  the  data  set. 

1.  The  Incident  Reporting  System 

(C)  The  failure  of  the  VIST  project  to  proceed  into  Task  3, 
the  reporting  and  recording  of  Royal  Thai  Government  (RTG)  forces  data, 
seemed  to  be  the  greatest  weakness  of  the  VIST  project  from  the  point  of 
view  of  usefulness  of  the  coded  data.  Regardless  of  the  amount  of 
analysis  performed  or  the  analysis  techniques  applied,  the  results  would 
still  have  to  be  interpreted  as  only  a  one-sided  picture  of  Communist 
Terrorist  activity.  In  the  absence  of  information  concerning  RTG  counter- 
insurgency  forces  and  operations,  both  military  and  civic  action;  there 
was  no  way  to  make  interpretations  of  significant  underlying  cause-effect 
relationships  which  might  be  indicated  by  the  results  of  data  analysis. 
An  increase  in  insurgent  activity  followed  by  a  sharp  decrease  in  a  given 
area  could  indicate  aggressive  and  successful  RTG  counterinsurgency 
operations  or  a  virtual  ceding  of  the  area  to  Ccmmunist  Terrorist  control. 
In  the  same  manner,  periodic  cycles  of  varying  amplitude  could  be  indica- 
tive of  the  effectiveness  of  RTG  counterinsurgency  programs  in  retaining 
control  of  an  area  against  a  persistent  Camtunist  Terrorist  offensive, 
with  the  cycles  being  caused  by  the  time  lag  factor  in  resupplying 
personnel  and  equipment  to  the  insurgent  organization  from  outside 
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Thailand  following  an  unanticipated  insurgent  program  setback;  or  it 

could  be  the  effects  of  local  weather  on  insurgent  mobility.  In  this 

regard  it  was  noted  that  the  local  weather  was  not  an  item  of  information 

reported. 

(C)  The  second  major  element  of  the  reporting  system  considered  was 

that  the  data  information  contained  in  an  incident  file  represented  a 

report  received  under  an  incident  code  and  not  a  single,  exclusive 

representation  of  an  event.  The  incident  codes  are  listed  and  defined 

in  Appendix  C.  After  several  series  of  successive  reports  were  reviewed 

from  different  geographic  areas  the  following  hypothetical  event  was 

constructed: 

A  eommunist  Terrorist  courier  was  sighted  by  a  police  patrol  boat 
while  crossing  the  Mekong  from  Laos.  The  courier  was  apprehended 
the  next  day  approximately  15  kilometers  from  the  border  crossing 
point  while  trying  to  avoid  a  checkpoint  set  up  by  the  Regional 
Force  in  response  to  the  police  alert.  The  courier  was  carrying 
mail  addressed  by  Vietnamese  name  only  and  an  operational 
directive  for  a  planned  increase  in  armed  propaganda  and  terrorism. 
After  being  transferred  to  an  RTA  Interrogation  Center,  the  courier 
revealed  the  location  of  two  base  camps  he  was  to  service  in  the 
mountains  and  the  general  location  of  his  point  of  departure,  a 
headquarters  and  training  center,  located  in  Laos. 

The  inflation  factor  was  apparent,  but  it  was  also  apparent  that  the 
majority  of  the  incident  codes  were  not  readily  subject  to  multiple 
incident  reports.  The  inflation  factor  was  considered  significant  how- 
ever, based  on  the  relatively  small  numbers  of  incidents  occurring  per 
month  in  most  amphoe  size  areas,  particularly  with  respect  to  total 
incident  level  forecasting.  One  method  of  reducing  multiple  reporting 
of  one  event  or  instance  of  Corrnunist  Terrorist  activity  would  be  the 
assignment  of  an  identification  "tag"  to  an  event,  such  as  a  firefight, 
where  this  tag  would  be  an  element  of  all  reports  resulting  from  the 
examination  of  documents  or  the  interrogation  of  prisoners  obtained  as 
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a  result  of  the  encounter.  Another  approach  would  be  the  review  of  the 
incident  reporting  codes  to  eliminate  those  items  which  are  essentially 
of  intelligence  interest  only  and  do  not  constitute  events  in  space  or 
time.  Rarely,  if  ever,  is  the  capture  of  a  document  an  isolated  event 
or  of  significance  in  itself;  the  significance/  if  any,  stems  from  the 
information  contained  within  the  document  which  can  be  exploited  by  the 
RTG  in  its  counter insurgency  program. 

(C)  Within  the  format  of  the  data  elements  coded  (Appendix  A)  ,  the 
categories  of:  incident  reporter/  incident  initiator,  and  incident  tar- 
get require  a  total  of  16  columns  of  0  or  1  entries.  Since  each  of  these 
categories  is  partitioned,  numerical  coding  of  the  variables  could 
represent  the  same  information  in  three  columns. 

2.  The  Reported  Information  (U) 

(U)  It  was  discovered  that  a  useful  aid  in  an  overall  analysis 
of  the  information  content  of  the  VIST  data  was  a  by-product  of  computer 
analysis  routines  performed  on  the  data,  namely,  a  listing  of  the  means 
and  standard  deviations  of  the  variables  under  consideration.  The 
listing  for  the  total  data  set,  6264  incident  report  files,  is  shown  in 
Table  I. 
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TABLE  I 

OBSERVED  DATA  PARAMETERS 

GABLE 

MEAN 

ST.DEV. 

1 

26.131195 

18.011444 

2 

0.171372 

0.376819 

3 

0.274179 

0.446067 

4 

0.070362 

0.255772 

5 

0.108319 

0.310793 

6 

0.147906 

0.354994 

7 

0.217509 

0.412522 

8 

0.048523 

0.214827 

9 

0.013565 

0.115676 

10 

0.028411 

0.166154 

11 

0.172968 

0.378213 

12 

0.004628 

0.067870 

13 

0.078781 

0.269410 

14 

0.076390 

0.265678 

15 

0.038309 

0.191950 

16 

0.052996 

0.223996 

17 

0.001117 

0.033409 

18 

14.444157 

9.418326 

19 

5.786589 

3.274970 

20 

0.468303 

0.498937 

21 

0.042939 

0.202728 

22 

0.109432 

0.312135 

23 

0.024739 

0.155335 

24 

0.492889 

0.499886 

25 

15.070463 

41.106903 

26 

0.678805 

9.149450 

27 

8.531009 

8.106230 

(U)  The  analysis  effort  was  placed  primarily  of  the  means  of  the 
variables  since  no  specific  interpretation  or  significance  could  be 
assigned  to  the  variance  of  a  dichotomous,  zero-one,  variable. 
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(C)  Variable  number  one  was  the  incident  code  reported.  When  the 
incident  codes  (Appendix  C)  were  studied,  it  was  concluded  that  there 
was  a  sufficient  ordering  underlying  the  values  of  this  variable,  from 
more  violent  to  less  violent  with  increasing  magnitude  of  the  variable, 
for  comparisons  to  be  made.  This  ordering  was  very  fundamental  and 
very  dependent  on  the  interpretations  of  the  definitions  of  the  inci- 
dent codes  since  the  definitions  are  not  unambiguous.  The  numerical 
gaps  in  the  code  and  the  predominance  of  nonviolent  codes,  which 
seemed  to  have  a  lower  degree  of  ordering  than  the  violent  codes, 
forced  the  conclusion  that  no  useful  meaning  could  be  assigned  to  the 
absolute  numerical  value  of  the  mean  for  any  geographical  area  or  time 
span.  However  between  different  areas  or  time  spans,  the  difference 
in  the  means  would  provide,  in  a  very  general  sense,  a  relative  measure 
of  the  degree  of  violence  associated  with  the  insurgency  activity. 
The  significance  of  such  a  measure  was  considered  to  be  derived  from 
degree  of  violence  as  an  indicator  of  phase  progression  in  the 
generally  accepted  formula  or  model  of  a  Communist-type  insurgency  or 
war  of  liberation. 

(C)  Variables  two  through  seven  constituted  the  reporter  of  the 
incident.  Since  the  six  variables  constitute  a  partition  of  the  set  of 
incident  reporters,  it  was  expected  that  the  mean  value  of  a  variable 
would  indicate  the  propertion  of  the  total  incidents  reported  by  that 
variable.  The  sum  of  these  means  was  equal  to  .989647  which  indicated 
that  the  identity  of  the  originator  of  the  incident  report  was  recorded 
on  the  report  and  transferred  to  the  data  set  for  about  99  per  cent  of 
the  reports  in  the  data  base.  Variables  eight  through  twelve  consti- 
tute a  partition  of  the  set  of  incident  initiators.  The  sum  of  these 
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means  was  .268095  with  the  mean  of  the  unknown  initiator  variable 
(Variable  12)  equal  to  .004628.  If  it  had  been  the  case  that  the  sum 
of  the  means  was  nearly  equal  to  one,  the  means  would  have  been  useful 
as  indicators  of  the  proportions  of  activity  initiated  by  government 
forces  and  insurgent  forces.  No  explanation  was  found  for  the  high 
proportion  of  incident  files  with  no  information  recorded  in  this 
category,  and,  with  only  about  27  per  cent  information,  the  application 
of  proportions  to  the  total  recorded  was  not  considered  to  be  a  valid 
technique  for  approximating  the  actual  proportions  which  would  have 
been  available  without  missing  data.  The  same  situation  was  observed 
in  the  consideration  of  variables  thirteen  through  seventeen,  the 
target  of  the  incident  category,  where  the  sum  of  the  means  was  equal 
to  .247593,  with  the  mean  of  variable  seventeen  (questionable  target) 
equal  to  .001117.  It  was  noted  however,  that  all  the  variables  in  this 
category  tended  to  reflect  incidents  which  would  be  initiated  by 
insurgents  and  not  government  forces.  The  largest  proportion  of  the 
recorded  information  concerning  the  initiator  of  incidents  corresponded 
to  Communist  Terrorists,  but  the  next  largest  proportion  corresponded 
to  the  Royal  Thai  Army  initiated  incidents.  It  was  conjectured  that 
the  fact  this  category  had  the  lowest  sum  of  means  might  stem  from  a 
feeling  among  report  originators  that  the  category  was  not  exhaustive 
and  no  entry  at  all  might  be  better  than  an  erroneous  entry. 

(C)  Considering  the  completely  ordered  variables;  day  of  month 
(Variable  18) ,  month  (Variable  19) ,  and  hour  of  day  (Variable  27) : 
the  means  and  standard  deviations  were  both  evaluated.  In  the  case  of 
the  day  of  month  and  month  variables,  the  observed  means  and  standard 
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Standard 

Mean 

Deviation 

15.5 

8.09 

6.5 

3.18 

deviations  correspond  closely  to  the  expected  means  and  standard 

deviations  of  random  variables  with  a  uniform  distribution  over  the 

appropriate  intervals,  as  shown  by: 

Observed  Expected  (U(l,  30)  or  U(l,  12)) 

Standard 
Mean     Deviation 

Day     14.444157    9.418326 
Month    5.786589    3.274970 
The  mean  of  the  hour  of  day  is  less  than  the  expected  mean  by  a  factor 
of  2/3,  and  the  standard  deviation  less  by  a  factor  of  1/2.  The  indi- 
cation was  that  the  preponderance  of  incidents  took  place  in  the 
morning  hours  rather  than  being  uniformly  distributed  throughout  the 
day  or  the  average  daylight  hours. 

(C)  With  respect  to  the  people  involved  (Variables  21  through  26) , 
it  was  noted  that  while  the  average  incident  involved  15  Commnist- 
Terrorists,  the  standard  deviation  of  this  variable  was  almost  three 
times  as  large  as  the  mean.  This  would  tend  to  reduce  the  value  of  the 
mean  of  this  variable  as  a  comparative  indicator.  It  was  also  observed 
that  approximately  one  half  of  all  incident  reports  included  a  positive 
response  to  Variable  24  (other  ethnic  groups) .  Dependent  on  the 
judgement  of  operational  usefulness  of  ethnic  information,  it  could  be 
worthwhile  to  expand  the  ethnic  variable  categories,  or  to  generate  a 
numerical  code  for  the  groups  to  be  included  in  this  variable. 
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III.   FACTOR  ANALYSIS  (U) 

(U)  An  objective  in  the  analysis  of  a  large  set  of  data  which  has 
the  potential  of  producing  significant  or  useful  results,  is  a 
reduction  in  the  size  of  the  data  set.  If  this  can  be  done  without 
destroying  the  useful  value  of  the  information  contained  in  the  data, 
the  result  is  a  smaller  set  of  data  to  manipulate,  thus  simplifying 
additional  analysis  efforts.  It  is  the  objective  of  factor  analysis 
to  represent  a  set  of  variables  in  terms  of  a  smaller  set  of  factors 
which  can  be  constructed  from  the  data. 

(U)  The  simplest  model  for  representing  one  variable  in  terms  of 
other  variables  is  the  linear  model  which  is  used  in  most  factor 
analyses.  Within  the  linear  framework  there  are  two  distinct 
approaches:  to  extract  the  maximum  variance  from  the  data,  or  to  maxi- 
mally reproduce  the  correlations  among  the  variables.  Harmon  [2]  and 
Morrison  [3]  were  considered  to  be  the  better  references  for  the 
material  which  follows.  For  consistency,  the  notation  provided  by 
Harmon  was  used  wherever  possible. 
A.   PRINCIPAL  COMPONENTS  ANALYSIS  (U) 

(U)  The  method  of  principal  components  empirically  reduces  a  large 
set  of  data  so  that  n  original  variables  are  describable  in  terms  of 
F  uncorrelated  components.  If  x.  is  an  original  variable,  the  model  is 

(3-1)    x.  =  a.^  +  a.2F2  +  ...  +  a.Fn  (j  =  1,  2,  . . . ,  n) . 

All  n  components  are  required  to  reproduce  the  correlations  among  the 

n  original  variables,  but  each  component,  in  turn,  makes  a  maximum 

contribution  to  the  sum  of  the  variances  of  the  original  variables  . 

That  is,  F,  is  tha-t  linear  ;carbir£itioji  of  the  n  original  variables 
1*  ■  >  i  >  i   in 
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which  contributes  a  maximum  to  the  sum  of  the  variances  of  the  original 
variables,  F„  contributes  a  maximum  to  the  remaining  variance,  and  so 
on;  with  the  sum  of  the  variances  of  the  n  principal  components  being 
equal  to  the  sum  of  the  variances  of  the  n  original  variables.  Conse- 
quently, depending  upon  the  amount  of  total  variance  which  it  is  desired 
to  retain  in  the  reduced  data  set,  only  a  small  number  of  the  components 
will  be  required  to  represent  the  original  data  in  a  practical  application 
of  the  method. 

1.  Geometric  Interpretation  (U) 

(U)  The  principal  components  solution  can  be  described  geometri- 
cally by  considering  a  scatter  of  p  observed  values  of  the  n  variables 
in  n-space.  Harmon  [2]  proves  that  for  the  principal  component  solution, 
where  the  rank  of  the  correlation  matrix  is  n,  that  the  common- factor 
space  for  these  observations  is  n-dimensional .  The  swarms  of  points  will 
be  generally  concentric,  similiar,  and  similarly  situated  ellipsoids; 
exactly  so  for  a  normally  distributed  population  [7].  If  the  major  axis 
of  the  ellipsoid  is  defined  as  the  axis  passing  through  the  direction  of 
maximum  variance  of  the  points,  Morrison  [3]  shows  this  major  axis 
corresponds  to  the  first  principal  component  when  the  axis  is  rotated  to 
maximize  variance.  The  maximizing  solution  would  be  the  characteristic 
vector  of  the  greatest  root  (eigenvalue)  of  the  sample  correlation 
matrix.  The  remaining  roots  and  characteristic  vectors  of  the  correlation 
matrix  would  then  determine  the  orientation  of  the  higher  ordered  princi- 
pal component  axes.  The  geometric  interpretation  also  implicitly 
contains  the  minimizing  of  the  sums  of  squares  distances  from  each  point 
to  its  projection  on  each  of  the  successive  n  coordinate  axes.  Mini- 
mization of  the  sums  of  squares  distances  is  equivalent  to  the 
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maximization  of  variances,  and  an  orthogonal  least-squares  solution  to 
the  best  fitting  line  to  the  swarm  of  points  would  yield  the  first 
principal  component  axis  directly.  Successive  orthogonal  least-squares 
solutions  would  yield  the  remaining  n-1  axes. 

2.  Limitations  of  Principal  Components  (U) 

(U)  Principal  components  was  described  above  as  being  highly 
dependent  of  the  sums  of  the  variances  of  the  original  variables.  If 
these  variables  are  not  measured  in  the  same  units,  changing  the  scale 
or  performing  other  linear  transformations  of  the  variables  could  so 
change  the  shapes  of  the  ellipsoids  until  the  axes  (principal  compo- 
nents) would  have  no  consistent  meaning  and  the  results  of  the  analysis 
would  not  be  useful  in  any  practical  sense. 
B.   CLASSICAL  FACTOR  ANALYSIS  (U) 

(U)  Classical  factor  analysis  (which  will  be  referred  to  as  factor 

analysis  in  the  following  discussion  to  distinguish  the  classical  factor 

analysis  model  from  the  principal  components  analysis  model)  generally 

uses  the  second  approach  to  factor  analysis  in  that  the  model  is 

basically  designed  to  maximally  reproduce  the  correlations  among  the 

original  variables.  The  model  describes  each  of  the  original  n 

variables  (x . ,  i  =  1 ,  2 ,  . . . ,  n)  in  terms  of  m  common  factors  and  a 

unique  factor  by 

(3-2)    x.  =  a.,F,  +  a.~F0  +  ...  +  a.  F  +  d.U.  . 
w  '     l    ll  1    i2  2         rm  m    i  i 

The  number  of  common  factors,  m,  is  usually  much  smaller  than  n,  and 

these  m  factors  account  for  the  correlations  among  the  original 

variables.  The  unique  factor  accounts  for  error  and  the  remaining 

variance  of  the  variable.  In  this  model  the  F's  are  considered  random 

variables  and  the .correlations  among  the  variables  of  a  given  sample 
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are  treated  as  if  they  were  the  population  correlations;  that  is, 
statistical  variations  are  ignored  and  no  assumptions  are  made,  in  the 
general  case,  about  the  statistical  distributions  of  the  original 
variables.  Since  the  means  and  variances  of  the  F's  and  U*s  are  unknown 
in  practice,  they  can  be  assumed  to  be  zero  and  unity  respectively  with 
no  loss  of  generality  in  the  model;  additionally,  the  n  unique  factors 
are  supposed  to  be  mutually  independent  and  independent  of  the  m 
common  factors.  The  basic  problem  of  factor  analysis  is  to  estimate  the 
nm  loadings  of  the  common  factors  [ 2 ,  Chap.  2 ] . 

1.  Principal  Factor  Solution  (U) 

(U)  Harmon  states  that  the  principal  factor  solution  is 
probably  the  most  widely  used  technique  in  factor  analysis.  The  tech- 
nique is  adapted  from  the  principal  components  solution  to  the  factor 
analysis  model  (3-2) ,  and  is  distinct  from  principal  components  in  the 
amount  of  variance  analysed.  The  communalities  of  the  variables  are 
placed  on  the  diagonal  of  the  correlation  matrix  to  obtain  a  reduced 
matrix  for  principal  factor  analysis.  Where,  in  the  principal  compo- 
nents model,  the  principal  components  can  be  expressed  in  terms  of  the 
observed  variables,  approximating  procedures  must  be  used  in  determiriing 
the  principal  factors. 

(U)  The  sum  of  squares  of  factor  coefficients  gives  the  communality 
of  a  specified  variable  where  one  coefficient  is  representative  of  the 
contribution  of  that  factor  to  the  communality  of  the  variable.  The 
principal  factor  method  selects  successive  ordered-factor  coefficients 
so  as  to  make  the  sum  of  the  contributions  of  the  ordered  factors  to  the 
total  communality  a  maximum.  For  the  first  factor  this  sum  is  given  by 
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with  each  a.,  chosen  to  maximize  V  under  the  constraint 
ll 

m 
(3-4)    r.,  =  E  a.  a, 

uc  p=1  rpkp       (i^k  =  X/  2,  ...,  n), 

where  r.  .  is  the  communality  of  variable  i  and  r.,  =  r,  . .  The  con- 
11  J  jk        kj 

straint  conditions  are  the  replacement  of  the  observed  correlations  by 
the  reproduced  correlations  which  implies  the  assumption  of  zero 
residuals . 

The  solution  technique  for  this  constrained  maximization  problem  pro- 
posed by  Harmon  [2]  is  the  application  of  Lagrange  multipliers,  u. .  , 

which  defines  a  new  function  T,  such  that 

n  n    m 

(3-5)    2T  =  Vn  -  E    u..r..  =  Vn  -  E     E   u..a.  a.   . 
1  j/fe=1  DkDk    1  j/k=lp=1   Dkjp^ 

The  partial  derivatives  of  the  new  function  T  with  respect  to  any  one 
of  the  n  variables  a.,  are  set  equal  to  zero,  giving, 

3T  n 

while  the  partial  derivatives  with  respect  to  any  of  the  other 
coefficients,  a.   (p^  1)  are  set  equal  to  zero,  giving 

Jr 


<3"7>    3a. 


9T       n 


l±    "jk^p  ~  0  '  P  *  i 


JP    k= 
These  sets  of  equations  are  combined  as  follows: 

(3-8)    H-  ■  6lpajl  "Jx  "jk^p  =  0  ,  P  =  1,  2 m, 

where  the  Kronecker  Delta,  5,  =  1  if  p  =  1,  and  S,  =  0  if  p  ^  1. 

If  the  combined  equations  are  multiplied  by  a.,  and  the  summing 

operation  completed  with  respect  to  j  this  gives 

n  n      n 

(3-9)         5,       E  a?,     -     E       E     u.,a.,a,      =  0. 


L       E  atn     -     E       E     u.,  a.,a, 
iP^l^l         j=lk=1     DkDl°kp 
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■^  Ujkajl  =  a^  .  and  if  ^  afr  =  ^  , 


then  equation  (3-9)  can  be  written  as 


n 


(3-10) 


%**■     "^^l^p 


=  0. 


If  this  equation  is  multiplied  by  a.  and  the  summing  operation 

completed  with  respect  to  p  ,  the  resulting  equation  is 

n        m 
(3-11)    a_.n  A.  -  z  a, _,    Z  a.  a.    =0. 


jl 


k=l  """   ptl  *&**> 


When  the  constraining  conditions  in  equation  (3-4)  are  applied  to  this 
equation,  the  result  is 


n 

S 

k=l 


(3"12)     E  rjk\l  --\ajl  =0 


which  is  a  set  of  n  equations ,  one  equation  for  each  value  of  j ,  the 
first  of  which  can  be  written,  in  terms  of  communality  (r, ,  =  hp  as 

(3-13)    (h^  -  A)al:L  +  r12a21  +  ...  +  r^a  ,  =  0. 

Hadley  [6]  has  shown  that  a  necessary  and  sufficient  condition  for  this 

type  set  of  equations  to  have  a  nontrivial  solution  is  the  vanishing  of 

the  determinant  of  the  coefficients  of  the  unknowns ,  a . , .  The  expansion 

of  the  determinant  is  an  n   order  polynomial  in  A.  This  polynomial, 

the  characteristic  equation  of  the  set  of  equations,  has  a  family  of 

solutions,  all  of  which  are  proportional  to  one  solution,  where  the 

factor  of  proportionality  is 

n 


(3-14) 


A,  =  I  a?. 
1   j=l  31 


However,  this  is  the  equation  for  V,  ,  (equation  (3-3))  ,  the  quantity  to 
be  maximized;  that  is,  the  maximizing  solution  to  V.  is  the  largest  root, 
or  eigenvalue,  of  the  characteristic  equation.  To  find  the  coefficients 
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of  the  first  factor,  Fy   which  will  account  for  the  maxiinum  amount  of 
conmunality  possible,  the  value  of  ?l  is  substituted  into  the  set  of 
equations  of  type  (3-13)  and  a  solution  vector,  eigenvector,  (a)  is 
obtained.  To  satisfy  the  conditions  of  equation  (3-3) , 

ajl  /Al 


(3-15)    aj:L  = 


r 


a?n  +  a2.,  +  ...  +  a2 


(j  =  1,  2,  ... ,  n) 


lll  T  a21 


hi 


gives  the  coefficients  for  the  first  factor,  F  . 

The  resulting  problem  is  to  then  find  the  coefficients  for  the  factor, 
F2,  which  will  account  for  a  maximum  amount  of  the  residual  communality. 
The  residual  correlations  after  the  first  factor  has  been  determined 
are  given  by 


jk  "  rjk  "  "jAl  -  ^2^2  +  aj3\3  +  '••  +  ^m3]^ 


(3-16) 
and  the  quantity  to  be  maximized  is 


(3-17)    V2  =  aJ2  +  a|2+...+a22, 

subject  to  the  constraint  conditions  in  equation  (3-16) .  Iteration  of 
the  Lagrange  multiplier  technique  yields  Xy,   the  second  largest  eigen- 
value of  the  solution  to  the  equations  of  type  (3-13)  as  the  maximizing 
value  of  V„.  The  factor  coefficients  are  then  determined  in  the  same 
manner,  using  equation  (3-15) ,  as  was  used  for  the  first  factor.  The 
procedure  is  applied  simultaneously  in  practice  to  obtain  all  factor 
coefficients,  each  corresponding  to  successively  decreasing  values  of 
the  eigenvalues. 
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C.  APPLICATION  TO  VIST  DATA  (U) 

(U)  The  27  variables  [1]  were  retained  frcm  the  analysis  of  the 
initial,  1592  reports,  data  set,  and  are  defined  in  Appendix  D.  Bio- 
medical Computer  Program  (BIMED)  03M  [A]  was  used  for  principal  compo- 
nents analysis  and  BIMED  X72  [5]  was  used  for  principal  factor  analysis. 
The  computational  procedures  for  these  programs  are  described  in 
Appendix  E  and  Appendix  F  respectively. 

(C)  The  two  initial  runs  were  made  for  comparison  of  the  factor 
patterns  from  the  expanded  data  set  with  those  obtained  by  Gardner  using 
the  same  factor  analysis  technique.  The  first  run  was  for  the  total 
data  set.  In  geographical  area  coverage  as  well  as  time  period  coverage, 
the  initial  data  set  was  a  subset  of  the  expanded  data  base.  The  results 
of  this  comparison  indicated  a  general  agreement  among  the  factor 
loadings  with  recording  of  the  factors  accomplished.  The  order  of  the 
11  factors  from  the  initial  data  corresponded  to  the  following  order  of 
10  factors  from  the  expanded  data: 

1592  -123456789  10  11 
6264  -213459687  No 
The  tenth  factor  from  the  expanded  data  loaded  very  high  (VH)  on  variable 
19,  month  of  incident;  there  was  no  corresponding  loading  with  the  initial 
data.  The  initial  data  factor  10  loaded  on  Vietnamese/Pathet  Lao  which 
loaded  high  negative  (H-)  in  the  expanded  data  factor  5.  The  very  high 
(VH)  loading  for  Non-Communists  for  factor  11  from  the  initial  data  did 
not  appear  in  the  factor  pattern  from  the  expanded  data. 

(C)  These  results  agreed  with  the  anticipated  result  of  only  general 
correspondence  between  the  first  few  factors  which  was  expected  because 
of  the  divergent  area  and  time  coverage. 
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(C)  The  second  run  compared  415  incident  reports  in  Amphoe  Chiang 
Khong,  Changwat  Chiang  Rai,  with  312  incident  reports  for  the  same  area 
in  the  initial  data.  The  resulting  factors  ordered  as  follows: 

312  -123456  789  10 

415  -123654  No  7  9  No 
The  expanded  data  analysis  generated  factors  which  could  be  interpreted 
as  Police/Non-Communist  Terrorists  and  Agent  Reported  in  the  seventh  and 
tenth  positions.  These  factors  were  not  found  in  the  initial  factors. 
The  Date/Time  and  Paramilitary  factors  in  the  initial  analysis  were  not 
reproduced  from  the  expanded  data  set.  However,  the  factor  patterns 
obtained  from  the  same  area  with  an  increase  in  time  period  only  were  in 
closer  agreement  than  the  total  data  set  factor  patterns.  The  factor 
patterns  obtained  from  the  initial  data  are  reproduced  in  Appendix  G. 
1.  Analysis  of  a  Changwat  (U) 

(C)  Changwat  Nakhon  Phanom  was  selected  for  more  detailed 
applications  of  the  factor  analysis  techniques  to  the  data  base.  Nakhon 
Phanom  lies  in  northeast  Thailand  on  the  border  with  Laos.  Orientation 
maps  of  the  area  are  shown  in  Appendix  H.  The  first  instances  of  an 
active  insurgency  in  Thailand  were  observed  in  Amphoe  Na  Kae  and  Nakhon 
Phanom  has  had  the  highest  proportion  of  incident  reports  (2526/6264)  of 
any  changwat  in  Thailand.  Nakhon  Phanom  consists  of  ten  Amphoes  which 
are  very  divergent  in  their  size,  ethnic  populations,  population  densities, 
terrain,  transportation  and  communication  networks,  and  level  of 
insurgency  characteristics  [ 8 ] .  Incident  reports  ranged  from  16  in  the 
flat  Amphoe  Si  Songkhram  in  the  North  to  1266  in  the  mountainous  Amphoe 
Na  Kae  in  the  South. 
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(C)  Principal  factor  analyses  and  principal  components  analyses 
were  conducted  on  representative  sizes  of  data  sets  from  this  area. 
The  resulting  factor  matrices  and  the  factor  matrices  for  the  total 
data  set  are  contained  in  (XMPUTER  OUTPUT,  PART  1.  The  areas  are 
identified  in  Table  II. 

TABLE  II  (CONFIDENTIAL) 
GEOGRAPHICAL  AREAS  FOR  FACTOR  ANALYSIS  (U) 


Area  No. 

Code 

1 

2006 

2 

2001 

3 

2008 

4 

2004 

5 

2007 

6 

2005 

7 

20 

8 

— 

Geographical  Name 


Number  of  Incidents 


Amphoe  Ban  Phaeng  46 

Amphoe  Muang  93 

King  Amphoe  Don  Tan  140 

Amphoe  That  Phanom  196 

Amphoe  Mukdahan  358 

Amphoe  Na  Kae  1266 

Changwat  Nakhon  Phanom  2526 

North  and  Northeast  Thailand  6264 


The  first  six  areas  were  disjoint  and  independent,  together  they  formed 
a  subset  of  the  seventh  area,  and  the  seventh  a  subset  of  the  eighth. 
Study  of  the  factor  matrices  indicated  that  as  the  number  of  incidents 
considered  increased,  the  principal  factor  and  principal  component 
factor  matrices  tended  to  become  more  similar,  that  is,  to  generate  the 
same  factor  loadings.  This  indication  was  examined  in  more  detail  by 
measuring  the  differences  between  corresponding  elements  of  the  factor 
matrices  for  each  sample  size,  accounting  for  the  bipolar  nature  of  the 
factor  loading  vectors  where  necessary.  The  measures  of  difference 
considered  were:  the  average  difference  in  the  corresponding  elements, 
and  the  average  sum  of  squared  differences  in  the  corresponding  elements. 
The  result  of  this  analysis  is  shown  in  Table  III. 
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46 

0.0153366 

93 

0.0442839 

140 

0.0474198 

196 

0.0511202 

358 

0.0511445 

1266 

0.0380625 

2526 

0.0206374 

6264 

0.0028734 

TABLE  III  (UNCLASSIFIED) 

CORRESPONDING  ELEMENT  COMPARISONS  OF  FACTOR  MATRICES  (U) 

Sample  Size    Average  Difference    Sum  of  Squared  Differences 

0.0000020 
0.0007640 
0.0008715 
0.0000864 
0.0026657 
0.0001932 
0.0000000 
0.0000000 

(U)  Based  on  the  results  shown  in  Table  III,  it  was  concluded  that, 

given  the  data  base  considered,  the  two  procedures  tend  to  generate  the 

same  factors  with  a  sufficiently  large  sample  size. 

2.  Analysis  of  Generated  Data  (U) 

(U)  In  order  to  examine  the  conclusion  that  principal  components 

analysis  and  principal  factor  analysis  tend  to  generate  the  same  factors 

using  the  same  data  with  a  sufficiently  large  sample  size  in  a  more 

general  sense  than  using  the  VIST  data  set,  two  4000-occurrence  data 

sets  were  generated.  For  each  occurrence,  three  independent  standard 

normal  observations  were  taken,  y^y.^,  and  an  arbitrary  (4x3) 

coefficient  matrix,  A,  was  applied  to  the  Y  vector  to  generate  four 

normal  random  variables,  x, ,x~,x~,x.,   which  were  not  independent.  In 

matrix  terms: 

(3-18)    X  =  AYT 

Using  the  same  A  matrix  and  Y  vector ,  four  dichotcmous ,  dependent ,  and 

discrete  random  variables,  x',x',x',x*,  were  generated  by: 
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(3-19)  x-   =  1,  y1ail+y2a12+y3ai3>ail  yyl+a.2 y^+a.3 yy3 

=  0,  otherwise 

i     =  1,  2,   3,  4 

The  following  notation  was  applied. 

Let:   (f  . .)  be  a  principal  factor  (n  by  m)  factor  matrix  from 

normally  distributed  data, 

(f.  . .)  be  a  principal  factor  factor  matrix  from  discrete  data, 

(p  .  .)  be  a  principal  component  factor  matrix  from  normally 

distributed  data,  and 

(p,  . .)  be  a  principal  component  factor  matrix  from  discrete  data. 

The  measures  of  difference  between  factor  matrices  were  defined  by 

n   m 


(3-20) 


Mn=  Z   Z   (f   .  .  -  P  .  . ) 2 

mn 


for  normally  distributed  data,  and 

n   m 

(3-21)    m   =  E   E   (f,  .  .  -  p,  .  .)2 
i=1  j=1   di]   Fd  ij' 

mn 
for  discrete  data.  The  results  shown  in  Table  IV  were  obtained. 


.1  .!■*     C  C  I 


[I       .-Jf        I  II    I      f 


27 


UNCLASSIFIED 


I 

1 


UNCLASSIFIED 


i   -ii-> 


, 


TABLE  IV  (UNCLASSIFIED) 

COMPARISON  OF  PRINCIPAL  FACTOR  AND  PRINCIPAL  COMPONENT 

FACTOR  ANALYSIS  USING  NORMALLY  DISTRIBUTED  AND  DISCRETE  DATA  (U) 

*-4 


Sample  Size 

Mn  x  10  4 

Md  x  10' 

30 

2.567 

13.595 

60 

35.281 

17.764 

100 

20.828 

15.467 

200 

18.294 

6.454 

600 

21.886 

4.067 

1000 

23.732 

4.346 

2000 

21.462 

3.332 

4000 

0.000 

0.000 

Based  on  the  results,  and  qualified  by  the  limitation  of  the  amount  of 
testing  conducted,  it  was  concluded  that  the  analysis  with  generated 
data  tended  to  support  the  results  of  the  analysis  of  the  VIST  data, 
and  without  regard  for  the  underlying  distributions  of  the  data. 

(U)  Further  analysis  was  undertaken  to  examine  the  effects  of 
increasing  sample  size,  considering  only  the  case  of  normally  distri- 
buted data.  It  was  observed  that  the  computed  correlation  matrices, 
eigenvalues,  and  the  cumulative  proportion  of  total  variance  accounted 
for  by  each  factor  were  identical  (within  the  computational  limitations 
of  the  computer)  regardless  of  which  factor  analysis  technique  was 
applied,  for  a  given  sample  size.  This  tended  to  confirm  the  conclusion, 
based  on  the  theoretical  presentation  of  the  techniques,  that  the 
differences  in  the  solutions  were  a  result  of  the  communalities  computed 
by  the  principal  factor  method.  It  was  also  observed  that  the  indi- 
vidual elements  of  the  correlation  matrices,  the  eigenvalues,  and  the 
cumulative  propertions  of  total  variances,  each  tended  to  converge 
toward  a  specific,  but  unknown,  value  with  increasing  sample  size;  as 
would  be  expected  from  the  Law  of  Large  Numbers.  It  was  then  noted  that, 
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in  the  principal  factor  solution,  the  change  in  the  value  of  a  given 
factor  loading  for  a  variable  was  in  the  same  direction,  increasing  or 
decreasing,  as  the  change  in  the  coirmunality  for  that  variable;  and 
the  magnitude  of  the  change  was  generally  proportional  to  the  magnitude 
of  the  change  in  the  corcnunality.  This  led  to  an  examination  of  the 
comparative  stability  of  the  factor  loadings  with  respect  to  sample 
size  under  the  principal  factor  and  principal  components  solutions. 
The  changes  in  communality  are  shown  in  Table  V,  changes  in  factor 
loadings  for  the  principal  factor  solution  factor  one  in  Table  VI,  and 
the  changes  in  the  factor  loadings  for  the  principal  components  solution 
factor  one  in  Table  VII. 

TABLE  V  (UNCLASSIFIED) 
CQMMUNALrTIES  VS  INCREASING  SAMPLE  SIZE  (U) 


Sample  Size 

Variable  1 

Variable  2 

Variable  3 

Variable  4 

30 

.789517 

.929347 

.962307 

.972257 

60 

.744993 

.876937 

.908039 

.917428 

100 

.762194 

.897185 

.929005 

.938611 

200 

.762222 

.897218 

.929039 

.938645 

600 

.758955 

.893372 

.925057 

.934622 

1000 

.758638 

.892999 

.924670 

.934231 

2000 

.760840 

.895591 

.927355 

.936943 

4000 

.624458 

.932500 

.975567 

.985781 

TABLE  VI  (UNCLASSIFIED) 
CHANGES  IN  FACTOR  LOADINGS  VS  SAMPLE  SIZE,  PRINCIPAL  FACTOR  (U) 
Sample  Size    Variable  1    Variable  2    Variable  3    Variable  4 

-.02578 

.01075 

.00002 
-.00204 
-.00019 

.00237 

.01930 


29 


30-60 

-.02542 

60-100 

.00991 

100-200 

.00001 

200-600 

-.00187 

600-1000 

-.00018 

000-2000 

.00126 

000-4000 

-.08203 

02806 

-.02821 

01094 

.01100 

00002 

.00002 

00207 

-.00108 

00020 

-.00020 

00139 

.00140 

02472 

.02491 
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TABLE  VII  (UNCLASSIFIED) 
CHANGES  IN  FACTOR  LOADINGS  VS  SAMPLE  SIZE,  PRINCIPAL  COMPONENTS  (U) 


Sample  Size 

Variable  1 

Variable  2 

Variable  3 

Variable  4 

30-60 

-.10249 

-.02048 

-.00219 

.00176 

60-100 

.03436 

.01528 

-.00009 

-.00242 

100-200 

.00328 

-.00515 

.00047 

.00201 

200-600 

-.00918 

.00081 

-.00037 

-.00072 

600-1000 

.00039 

-.00022 

.00017 

.00040 

1000-2000 

.00496 

.00067 

.00000 

.00018 

2000-4000 

-.00146 

.00004 

-.00006 

-.00015 

30 
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The  indication  was  that  the  factor  loadings  from  the  principal  component 
solution  tended  to  converge  to  some  specific  value  in  the  same  manner 
as  the  correlation  matrix.  Combining  this  indication  with  the  apparent 
relationship  between  factor  loadings  and  communalities  from  the  princi- 
pal factor  solution,  and  the  apparent  tendency  toward  similar  solutions 
from  the  two  techniques  with  increasing  sample  size,  the  indication 
that  the  factor  loadings  from  the  principal  factor  solution  tended  to 
converge  to  some  specific  value  as  a  function  of  the  convergence  of 
the  correlation  matrix  and  the  communality,  where  the  conmunality  of 
a  variable  replaced  its  autocorrelation  prior  to  the  final  iteration 
of  the  principal  factor  solution  technique. 

(U)  There  remained  the  question  of  why  the  two  techniques,  each 
with  the  same  correlation  matrix,  produced  different  factor  matrices 
at  small  sample  sizes;  and  why  the  two  techniques  produced  essentially 
the  same  factor  matrices  at  large  sample  sizes,  again  each  starting 
with  the  same  correlation  matrix.  That  is,  what  is  the  relationship 
between  sample  size  and  communality;  the  theoretical  indications  are 
that  they  are  unrelated,  but  the  empirical  evidence  supports  conver- 
gence of  the  solutions  at  large  sample  sizes.  A  satisfactory  answer 
to  this  question  was  not  found. 
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IV.   FORECASTING  INSURGENT  LEVELS  OF  ACTIVITY  (U) 
(U)  The  capability  of  completely  and  accurately  anticipating  or 
predicting  an  opponents  course  of  action  would  be  of  inestimable  value 
in  any  conflict  situation.  While  complete  and  accurate  prediction  is 
an  ultimate  goal,  it  has  rarely,  if  ever,  been  achieved  in  a  military 
situation.  In  general,  the  military  commander  must  use  estimates  of 
opposition  capabilities  which  are  based  on  a  subjective  evaluation  of 
the  degree  of  truth  and  accuracy  of  all  the  information  which  has  been 
gathered  concerning  the  opposition.  The  commander  can  then  apply  some 
type  of  subjective  probability  to  the  alternative  courses  of  action 
within  opposition  capability  in  order  to  arrive  at  a  decision  for  his 
own  best  course  of  action.  The  amount  of  probability  the  commander 
applies  could  be  influenced  by:  personal  experience  and  military 
judgement,  the  amount  of  available  intelligence  information,  historical 
information,  the  results  of  war  games  or  staff  studies,  or  any  of  many 
other  factors. 

(U)  Within  the  limited  military  intelligence  experience  of  the 
author,  mathematical  forecasting  techniques  have  not  been  used  in  an 
insurgency  environment  as  one  of  the  factors  to  influence  the  planning 
of  counterinsurgency  operations.  The  VIST  data  provided  a  base  for 
examining  some  forecasting  techniques  and  considering  criteria  for 
their  potential  usefulness  in  counterinsurgency  planning. 
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A.   FORECASTING  TECHNIQUES  (U) 

(U)  To  be  applicable  for  tactical  use  in  a  counterinsurgency 
environment,  a  forecasting  technique  should:  be  relatively  simple  in 
a  mathematical  sense,  not  requiring  the  use  of  sophisticated  computer 
equipment  or  extensive  hand  computations;  not  require  the  maintenance 
of  a  large  file  of  historical  data;  have  flexibility  in  adjusting  the 
rate  of  response;  and,  of  course,  be  relatively  accurate  depending  on 
the  command  level  and  the  specific  use  intended  for  the  forecasts. 

(U)  The  above  criteria  led  to  the  selection  of  three  algebraic 
models  and  their  associated  smoothing  techniques  for  forecasting  total 
insurgent  level  of  activity  in  a  specified  geographical  area.  Brown 
[9]  provided  the  majority  of  the  information  for  the  following 
discussion. 

1.  Constant  Model  (U) 

(U)  This  is  the  appropriate  model  where  there  is  some  reason 
to  accept  the  hypothesis  that  the  insurgency  is  relatively  stable 
within  the  area  and  time  span  being  considered.  The  number  of  incidents 
occuring  within  one  time  unit  x  ,  could  then  be  represented  by 
(4-1)    xt  =  a  +  et 

where  the  average  value  of  the  error  term,  e,  ,  is  zero  in  all  the 
techniques  being  considered.  The  basic  problem  in  the  model  is  to 
estimate  the  value  of  a. 

a.  Moving  Averages  (U) 

(U)  The  moving  averages  method  provides  a  simple  straight- 
forward technique  for  estimating  a  which  does  not  require  extensive 
historical  data.  Considering  the  N  most  recent  observations, 
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(4-2)    M.  =  x,  +  x.  ,  +  . . .  +  x.  . 
t    t    t-1         t-J 


•N 


N 
gives  the  average  of  these  N  observations  and  the  current  estimate 
for  a,  the  forecast  for  the  number  of  incidents  in  the  next  time  period. 
When  the  next  observation,  x , ,  is  received,  the  forecast  error  is 
given  by 

(4-3)    E  =  Mt  -  xt+1 

and  the  new  average  is 

(4-4)    Mt+1  =  Mt  +  xt+1  -  xfc_N 

N 

which  remains  the  average  of  the  N  most  recent  observations. 

(U)  Moving  averages  provides  an  accurate  estimate  in  that  the 
average  minimizes  the  sum  of  squares  of  the  differences  between  the  N 
most  recent  observations  and  the  estimate  of  the  next  observation.  The 
major  disadvantage  to  moving  averages  is  the  flexibility  in  adjusting 
the  rate  of  response.  The  rate  of  response  is  controlled  by  N,  the 
larger  of  the  value  of  N,  the  more  stable  the  estimates  and  the  more 
data  which  must  be  maintained  on  file.  To  have  the  capability  of 
increasing  the  stability  of  the  model,  desirable  if  the  level  of  insur- 
gent activity  was,  in  fact,  constant,  would  require  the  additional 
maintenance  of  historical  data  no  longer  being  used  in  the  forecasting 
procedure.  This  disadvantage  can  be  removed  if  x  „  is  estimated 
instead  of  being  retained  in  a  historical  file.  The  best  estimate  for 
x    is  the  average  of  all  data,  M  ,  so  the  estimate  of  the  next  average 
becomes 

(4-5)    Mt+1  =  Mt  +  xt+1  -  Mt  . 

N 
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Since  this  is  no  longer  a  moving  average  but  an  estimate  of  an  average. 
Brown  changes  the  notation  to  S  .  If  the  estimation  is  repeated  for 
successive  observations,  the  resultant  smoothed  function  is 
(4-6)    Sfc  =  oxt  +  (1  -a)  St_1 

and  the  process  is  called  exponential  smoothing, 
b.  Exponential  Smoothing  (U) 

(U)  The  forecast  for  time  t+1  by  the  method  of  exponential 
smoothing  is  the  value  of  S  computed  in  equation  (4-6) ,  and  it  is  a 
linear  combination  of  all  previous  observations  such  that  the  weight 
given  to  an  observation  decreases  geometrically  with  the  time  since 
the  observation  was  taken;  the  discrete  equivalent  to  the  exponential 
decay  process.  The  smoothing  constant,  a  ,  (0  <  a  <  1)  determines  the 
proportion  of  weight  which  will  be  placed  on  the  latest  observation, 
and  therefore,  the  response  rate  of  the  technique.  The  response  is 
similar  to  moving  averages  with  a  considered  like  1/N.  The  technique 
achieves  accuracy  in  that  it  minimizes  the  weighted  sum  of  the  residuals. 
2.  Linear  Model  (U) 

(U)  The  indication  of  a  constant  changing  rate  in  the  level 
of  observations  can  be  incorporated  into  the  forecasts  by  the  use  of 
a  linear  model  where 

(4-7)    x  =  a  +  bt  +  e 

represents  an  observed  value  and  estimates  for  two  coefficients  are 
required  for  forecasting.  The  moving  averages  technique  is  extended 
to  provide  a  least  squares  fit  to  the  data  using  the  N  most  recent 
observations.  The  analogous  double  smoothing  procedure  can  then  be 
defined  by  a  double  smoothed  value,  S2.  ,  of  the  observations  through 
time  t  which  is  given  by 
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(4-8)    S2t  =  aSt  +  (1  -  a)  S.  ^ 
and  the  coefficients  are  estimated  by 


i 


(4-9)    at  =  2St  -  S2t 

and 

(4-10)    b.    =  a/1  -  a  (S  -  S2  )  . 

Applying  these  coefficients  to  the  model,  the  forecast  for  the  next 
time  period  is  given  by 

(4-11)    xt+1  =  2St  -  S2t  +  a/1  -  a  (Sfc  -  S2fc) . 

The  forecast  error  is  given  by 

(4-12)    E  =  xt+1  -  xt+1  . 

Double  smoothing  requires  two  elements  of  historical  data  and  retains 
the  flexibility  of  response  reate  dependent  on  the  selection  of  a 
value  for  the  smoothing  constant.  Linear  trends  in  the  level  of 
insurgent  activity  are  accounted  for  with  only  a  moderate  increase  in 
computational  effort. 

3.  Quadratic  Model  (U) 

(U)  The  use  of  higher  order  polynomials  was  demonstrated  by 
the  quadratic  model  and  the  triple  exponential  smoothing  technique 
where  an  observed  value  is  represented  as 

(4-13)    x  =  a  +  bt  +  i/2ct2  +  et 
which  incorporates  the  indication  of  a  change  in  the  rate  of  increase 
or  decrease  of  incident  activity.  Subjectively  this  might  be  described 
as  an  acceleration  of  the  insurgency.  Since  three  coefficients  must  be 
estimated  for  forecasting,  a  triple  smoothed  value,  S3  ,  of  the 
observations  is  defined  as 
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(4-14)    S3  =  aS2t  +  (1  -  a)S3  ^ 

and  the  coefficients  can  be  estimated  by 

(4-15)    a.  =  3S.  -  3S2,.  +  S3,. 
— t     t     t     t 

(4-16)    bt  =  (  a/2(l  -  a)2)  (  (6  -  5a) S  -  2(5  -  4a)S2t  + 

(4  -  3a)S3t  ) 

(4-17)    c.  =  (a2/2(l  -a)  )  (S^  -  2S2^  +  S3.  )  . 
— t  t      t     t 

Canbining  the  coefficients  into  the  model  gives  the  ofrecasting 

relationship  for  the  next  time  period  as 

(4-18)    X_-  =  (2  -  3a  +  a2)S^  -  (6  -  2a) S2^_  +  2S3^   . 
t+l   t t t 

2  -  4a  +  2a2 
Three  elements  of  historical  data  must  be  retained  but,  otherwise,  the 
technique  retains  the  basic  characteristics  of  the  lower  ordered  models. 
The  computations  are  more  extensive,  however  the  coefficients  for  the 
smoothed  values  in  the  forecasting  equation  (4-18)  are  constant  with 
any  specified  value  for  the  smoothing  constant. 
B.  APPLICATION  TO  VIST  DATA  (U) 

(C)  Smoothing  techniques  for  the  constant,  linear,  and  quadratic 
models  were  applied  to  Changwat  Nakhon  Phanom  and  five  Amphoes  in 
NaWion  Phanom.  The  areas  were  selected  based  on  the  number  of  incidents 
reported  in  the  data  base  and  the  apparent  degree  of  stability  of  the 
insurgency  in  the  area  in  order  to  attempt  a  representative  sample  for 
comparative  purposes.  The  areas  are  defined  in  Table  VIII. 
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TABLE  VIII  (CONFIDENTIAL) 

IDENTIFICATION  OF  GEOGRAPHIC  AREAS  (U) 

Area  Geographical        Number  of        Number  of 

Number  Area  Incidents    Violent  Incidents 

1  Amphoe  Ban  Phaeng  46  3 

2  Amphoe  Muang  93  7 

3  Amphoe  Kam  Cha  I  200  23 

4  Amphoe  Mukdahan  355  78 

5  Amphoe  Na  Kae  1266  287 

6  Changwat  Nakhon  Phancm  2551  512 

(C)  The  first  nine  months  (January  -  September  1967)  of  total 
incident  data  were  taken  to  establish  the  initial  values,  S. ,  S2  ,  and 
S3  for  the  smoothing  techniques.  The  initial  values  were  computed  by 
least  squares  (moving  averages)  with  N  =  3  months.  A  constant  lead 
time  of  one  month  was  used  throughout  the  computations  and  smoothing 
constants  of:  .10,  .30,  .50,  .70,  and  .90  were  applied  to  each  tech- 
nique for  each  area.  The  FORTRAN  IV  program  for  the  forecasting  is 
listed  in  the  COMPUTER  PROGRAM  section. 

1.  Results  (U) 

(C)  The  results  of  the  application  of  forecasting  techniques 

are  in  COMPUTER  OUTPUT,  PART  2.  Five  criteria  were  considered  as 

having  the  potential  for  being  useful  indicators  of  the  accuracy  of 

forecasting  over  time,  depending  on  the  projected  use  of  the  forecasts. 

These  were: 

a.  The  average  forecasting  error,  defined  by 

33 

I      (x.  -  x.) 

i=10 . 

24 
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b.  The  average  percentage  forecasting  error,  defined  by 

33 

I        (x.  -  x.)/x. 
i=10   X    1   1  .  100 


24 


where  x.  4   0. 
l 


c.  The  average  absolute  value  of  the  forecasting  error  per 
incident,  defined  by 

33 

I        |x.  -  x. I 
i=10   x    1 


33 

E    x. 
i=10   X 

d.  The  average  sum  of  squared  forecasting  errors,  defined  by 

33 

E   (x.  -  x.)2 

1=10 


24 
e.  The  value  of  the  maximum  forecast  error,  defined  by 

VC\3X    [  (X,  „  —  X.  _)  ,  (X.  ,  —  X,  ,  )  ,  .  .  •  i  (X,-  ■-  X__)  ]  . 

No  attempt  was  made  to  apply  these  criteria  to  longer  range  forecasts 
than  one  month,  nor  to  evaluate  if  the  techniques  were  accurate  in  any 
absolute  sense.  One  of  the  criteria  for  relative  accuracy  was  con- 
sidered, the  average  sum  of  squared  forecasting  errors,  and  observations 
were  made  as  shown  in  Table  IX  concerning  accuracy  and  normalized 
absolute  forecasting  error  per  incident  compared  to  variations  in  the 
value  of  the  smoothing  constant. 
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TABLE  ;IX  -  '(Gd3FIDPWr IAL).  °  .,."..' 
RELATIVE  ACCURACY  OF  FORECASTING  RESULTS  (U) 


Overall 
Best  Technique 

Constant  Model 

a  =  .3 
| Error | /incident  =  .75 

Constant  Model 

a  =  .3 
| Error | /incident  =  .86 

Constant  Model 

a  =  .3 
| Error | /incident  =  .63 

Constant  Model 

a  =  .7 
| Error | /incident  =  .54 

Linear  Model 

a  =  .1 
| Error | /incident  =  .39 

Constant  Model 

a  =  .1 
| Error | /incident  =  .24 


Effect  of 
Smoothing  Constant 

Quadratic  Model  best 
at  a  =  .1,  otherwise 
Constant  Model  best 

Constant  Model  best 
throughout 


Linear  Model  best  at 
a  =  .1,  otherwise 
Constant  Model  best 

Quadratic  Model  best 
at  a  =  .1,  otherwise 
Constant  Model  best 

Linear  Model  best  at 
a  =  .1,  otherwise 
Constant  Model  best 

Constant  Model  best 
throughout 
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V.   CONGLUSIOK'S  MP  flE^^PffliONS  .(U) 
(C)  It  was  the  purpose  of  this  thesis  to  continue  the  analysis  of 
data  collected  during  the  early  stages  of  a  developing  insurgency.  This 
was  accomplished,  and  conclusions  and  recormendations  were  presented  as 
each  phase  of  the  analysis  was  examined.  There  were  some  conclusions, 
however,  which  were  not  specific  to  the  general  survey  of  the  data  or 
an  analysis  technique.  There  were  weaknesses  in  the  data;  the  absence 
of  information  concerning  the  government  forces  and  operations  seemed 
to  be  the  greatest  obstacle  to  using  the  data  in  a  meaningful  way  to 
assist  in  planning,  conducting,  or  evaluating  counterinsurgency 
operations  —  no  matter  what  analysis  techniques  were  applied  to  the 
data.  The  seriousness  of  this  problem  seemed  apparent  here;  it  was 
impossible  to  judge,  however,  what  the  effect  of  not  having  this  data 
has  been  on  analyses  conducted  in  Thailand. 

(C)  Given  that  there  were  weaknesses  in  the  data,  there  remained 
the  fact  that  it  was,  and  may  continue  to  be  for  some  time,  the 
largest  and  most  complete  set  of  computer  retrievable  information  con- 
cerning the  activities  of  Communist  insurgents  in  the  early  states  of 
an  insurgency,  and  the  likelihood  of  United  States  military  forces  being 
involved  in  counterinsurgency  operations  in  the  future  does  not  appear 
to  be  decreasing.  This  in  itself  was  sufficient  justification  for 
analysis  and  recommendations  which  might  improve  the  data. 
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(C)  The  limitation  pn  time  proclnded  deeper , exploration  into  the 
analysis  techniques  which  were  applied  to  the  d?ta.  aild  cor  .strained 
the  number  of  techniques  which  could  be  attempted.  The  reduction  of 
the  size  of  a  data  set  without  loss  of  significant  information  is  a 
valuable  element  in  determining  the  extent  to  which  the  data  will  be 
used.  The  operational  use  of  factor  scores  as  new  incidents  are 
reported,  plus  periodic  updating  of  the  factor  solution,  would  provide 
a  near  real-time  association  between  present  and  past  insurgent 
activity  —  given  that  the  validity  of  the  factors  and  their  interpre- 
tations were  generally  accepted  within  the  using  community.  Fore- 
casting insurgent  activity  was  considered  important  and  additional 
analysis  with  respect  to  the  models  applied  and  the  possibility  of 
sorting  types  of  incidents  into  subsets  with  different  characteristics 
would  seem  to  have  the  potential  of  improving  the  accuracy  of  the 
predictions,  and  accuracy  of  results  would  insure  expanded  usage.  The 
final  conclusion,  as  to  the  value  of  the  recommendations  for  changing 
the  VIST  reporting  systems  which  were  made  in  this  thesis,  can  only  be 
reached  if  the  recommendations  are  implemented  and  the  reporting  system 
reevaluated  at  a  later  date. 
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Column 

1-2 

3-4 

5-6 

7-8 

9-16 

17-18 

19-24 
19 
20 
21 
22 
23 
24 
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26 
27 
28 
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Append  ex  a 

VIST  DATA  FORMAT 
Element  of  Information 
Changwat  Code 
Amphoe  Code 
Tambon  Code 
Muban  Code 

UTM  Coordinates  of  Incident 
Incident  Code 
Reporter  of  Incident 
Royal  Thai  Army  (RTA) 
Civilian 
Police 
Paramilitary 
Source 
Unknown 

Initiator  of  Incident 
RTA 
Police 
Paramilitary 
Communist  Terrorist 
Unknown 


»  %  \ 


43 


;UN£IASSIFIED 


4       <• 


- 


UNCLASSIFIED 


30-34 


30 
31 
32 
33 
34 


35-36 

37-38 

39-40 

41-48 

49-56 

57-64 

65 

66 

67 

68 

69 

70-72 

73-75 

76-78 

78-80 


ITarofct  ;of  Inqident    ,  ,ni 

1  °  1 1  »»'»■'  ''''-'  .ii1 
RTA 

Civilians 

Police 

Paramilitary 

Questionable 

Day  of  Month  of  Incident 

Month 

Year 

Infiltration  UIM  Coordinates 

HEM  Coordinates  From 

UTM  Coordinates  To 

Armed  or  Uniformed 

Pathet  Lao  Involved 

Tribes  Involved 

Vietnamese  Involved 

Other  Ethnic  Involved 

Number  of  Communist  Terrorists 

Number  of  Non-Communist  Terrorists 

(Blank) 

Hour  of  Day 
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UNCLASSIFIED 
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UNCLASSIFIED 


Code   Changwat 
12   Chiang  Rai 


APPEMD:^  B 
GEOGRAPHICAL  CODES 
Code      Amphoe 

03   Chiang  Khong 


04   Chiang  Kham 


05   Chiang  Saen 


07   Thoeng 


Code        Tambon 

01  Wiang 

02  Khrung 

03  Bun  Ruang 

04  Po 

05  Muang  Yai 

06  Sathan 

07  Huai  So 

01  Yuan 

02  Chedi  Dam 

03  Chiang  Raeng 

04  Nam  Waen 

05  Fai  Kwang 

06  Wiang 

07  Sop  Bong 

01  Wiang 

02  Ban  Saeng 

03  Pa  Sak 

01  Wiang  Thoeng 

02  Ngao 

03  Ngiu 

04  Chiang  Khian 

05  Ta 

06  Tap  Tao 
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UNCLASSIFIED 


Code 


Changwat 


Code 


12   Chiang  Rai 
(Cont.) 


.Amphoe 


i  > 


07   Thoeng  (Cont.) 


08   Pong 


20   Nakhon  Phancm   01   Muang 
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Tambon 
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07 

j        |  '  '        ' 

Plong 

08 

Pa  Tan 

09 

Mae  Loi 

10 

Yang  Horn 

11 

Mae  Pao 

01 

Pong 

02 

Khuan 

03 

Ngim 

04 

Chiang  Muan 

05 

Pha  Chong  Noi 

06 

Yot 

07 

Sa 

08 

Oi 

61 

Nai  Muang 

62 

Nong  Saeng 

04 

Kurukhu 

05 

Kham  Thao 

06 

Dong  Khwang 

07 

Kham  Toei 

08 

Tha  Kho 

09 

Na  Sai 

10 

Na  Rat  Khwai 

11 

Nong  Yat 

13 

Ban  Klang 

15 

At  Samat 

16 

Ban  Phung 
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Code       Changwat 

20       Nahkon  Phancm 
(Cont.) 


CrSde 


Amphpe 


Cod* 


Tcntibcn 


02       Kham  Cha  1 


03       Tha  Uthen 


04       That  Phancm 


01  Ban  Song 

02  Kham  Cha  1 

03  Nong  Sung 

04  Nong  Sung  Tai 

05  Nong  Ian 

06  Ban  Kho 

07  Ban  Lao 

01  Tha  Uthen 

02  Chai  Buri 

03  Tha  Champa 

04  Na  Khamin 

05  Ban  Kho 

06  Pathai 

07  Phancm 

08  Phon  Sawan 

09  Ram  Rat 

10  Non  Tan 

01  That  Phancm 

02  Don  Nang  Hong 

03  Tha  Lat 

04  Na  Thon 

05  Nam  Kam 

06  Fang  Daeng 

07  Phra  Klang  Thung 

08  Phon  Thong 
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Code   Changwat 


Code  Aih£hc,$e 


Code 


Ta^ihpn^ 


20   Nahkon  Phancm   04   That  Phanom 
(Cont.) 


(Cont.) 

09 

Renu 

10 

Saen  Phan 

05   Na  Kae 

01 

Na  Kae 

02 

Kok  Turn 

03 

Kan  Luang 

04 

Dong  Luang 

05 

Na  Khu 

06 

Nong  Bo 

07 

Nong  Bua 

08 

Nong  Sang 

09 

Phra  Song 

10 

Phi  Man 

11 

Phum  Kae 

06   Ban  Phaeng 

01 

Ban  Phaeng 

02 

Na  Thorn 

03 

Nong  Waeng 

07   Mukdahan 

01 

Mukdahan 

02 

Dong  Yen 

03 

Na  Kok 

04 

Na  Sok 

05 

Ban  Khok 

06 

Phung  Daet 

07 

Phon  Sai 

08 

Wan  Yai 

09 

Si  Bun  Ruang 
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Code   Changwat 
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Code 
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20   Nahkon  Phanom 

(Cont.)       07   Mukdahan  (Cont.) 


08   Don  Tan  (K.A.) 


09   Si  Songkhram 


10       Pla  Pak   (K.A.) 


10 

Bang  Sai  Yai 

11 

Kham  A  Huan 

12 

Kham  Pa  Lai 

13 

Nikham  Kham  Soi 

14 

Na  Udom 

01 

Don  Tan 

02 

Pho  Sai 

03 

Pa  Rai 

04 

Ban  Bak 

05 

Lao  Mi 

01 

Si  Songkhram 

02 

Tha  Bo  Songkhram 

03 

Na  Dua 

04 

Na  Wa 

05 

Ban  Sieo 

06 

Ban  Uang 

07 

Sam  Phong 

08 

Na  Ngua 

01 

Pla  Pak 

02 

Kut  Ta  Kai 

03 

Khok  Sawang 

04 

Na  Makhua 

05 

Maha  Chai 

06 

Nong  Hi 
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02   Thung  Chang 
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Tambon 

62 

Ko^g,  Khwai 

03 

Chaiya  Sathan 

04 

Du  Thai 

05 

Du  Phong 

06 

Thum  Tong 

07 

Tha  Nao 

08 

Na  Sao 

09 

Na  Pang 

10 

Nong  Daeng 

11 

Nam  Kaen 

12 

Bo 

13 

Pha  Sing 

14 

Fai  Kaeo 

15 

Phong 

16 

Mo  Muang 

17 

Muang  Tit 

18 

Muang  Chang 

19 

Ruang 

20 

Suak 

21 

Suat 
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Nai  Wiang 

01 

Lae 

02 

Ngop 

07 

Pon 
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Code   Changwat 
26   Nan  (Cont.) 


Code 

Amphoe 

Code 

Tambon 

03 

Na  Noi 

01 

Na  Noi 

02 

Chiang  Khong 

03 

Na  Thanung 

04 

Bo  Kaeo 

05 

Muang  Li 

06 

Sisaket 

07 

Sathan 

08 

Santha 

04 

Pua 

01 

Pua 

02 

Ngaeng 

03 

Bo  Klua  Tai 

04 

Bo  Klua  Nua 

06 

Sila  Phet 

07 

Sila  Laeng 

08 

Sathan 

09 

Uan 

05 

Tha  Wang  Pha 

01 

Rim 

02 

Tan  Chum 

03 

Pa  Kha 

04 

Yam 

05 

Pha  To 

06 

Sa 

01 

Klang  Wiang 

02 

Khung 

03 

Tan  Chum 

04 

Na  Luang 

05 

Nam  Pua 
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Code   Changwat 
26   Nan  (Cont.) 


37   Phitsanulok 


52   Loei 


02   Nakhon  Thai 


Code     Amphoe        Code       Tambon 

06  Sa  (Cont.)         06  Nam  Muap 

07  Pong  Sanuk 

08  Yam  Hua  Nam 

09  Lai  Nam 

10  Wai  Na  Lai 

11  San 

07  Chiang  Klang  (K.A.)  01  Chiang  Klang 

02  Chiang  Khan 

03  Na  Lai  Luang 

04  Bua 

01  Nakhon  Thai 

02  Chat  Trakan 

03  Nakhon  Chum 

04  Na  Bua 

05  Noen  Phoem 

06  Nong  Kathao 

07  Pa  Daeng 

01  Chiang  Khan 

02  Khao  Khaeo 

03  That 

04  Na  Sao 

05  Bu  Ham 
07  Pak  Tom 

01  Dan  Sai 

02  Kok  Sathon 

03  Na  Di 


02   Chiang  Khan 


03       Dan  Sai 
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10 
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11 

I  Pum 

04 

Tha  Li 

01 
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02 

Nong  Phu 

03 

Lat  Khang 

04 

A  Hi 

07 

Pak  Chom 

01 

Pak  Chom 

02 

Chiang  Klom 

03 

Hat  Khamphi 

01 

Muang 
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02 

Kuan  Wan 
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Khai  Bok  Wan 
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Ban  Fang 
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Ban  Dua 
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Wiang  Khuk 
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Wat  That 

10 

Sa  Khrai 
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01  Seka 
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03  Dong  Bang 

04  Tha  Kok  Daeng 

05  Pho  Mak  Khaeng 

06  Ban  Tong 

01  Tha  Bo 

02  Kong  Nang 
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06  Ban  Thon 
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03  Chomphu  Phon 

04  Nang  Kheng 
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06  Don  Ya  Nang 

07  Non  Sombun 

08  Si  Chomphu 

01  Chum  Phon 
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03  Chum  Chang 

04  So 

05  Soem 
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Code   Changwat 

Code 

Amphoe 

Code 

Tambon 

71   Ubon 

03 

Khemmarat 

01 
02 
03 
04 
05 
06 
07 
08 
09 
10 

Khemmarat 
Kham  Pom 
Chiat 
Na  Tan 
Na  Waeng 
Nong  Phu 
Phalan 
Pho  Sai 
Muang  Yai 
Samrong 

05 

Khong  Chiam 

01 
02 
03 
04 
05 
06 
07 
08 

Na  Kham 
Kaeng  Kok 
Kham  Lai 
Nam  Thaeng 
Lat  Khwai 
Warin 
Song  Yang 
Uat  Yai 

07 

Chanuman 

01 
02 
03 
04 

Chanuman 
Khok  Kong 
Kham  Khuan  Kaeo 
Nong  Kha 
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10 
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Ban  Dan 
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Code   Changwat 
71   Ubon  (Cont.) 


1   >  ••  ■>  i 

Code 

Amphoe 

11 

Buntharik 

13   Phibun  Mangsahan 


Code 


Tarribon 

01  Phon  Ngam 

02  Kho  Laen 

03  Huai  Kha 

02  Kui  Chomphu 

03  Chik  Toeng 

04  Don  Chik 

05  Tan  Sum 

06  Sai  Mun 

07  Na  Pho 

08  Non  Klang 

09  Nong  Bua  Hi 

10  Pho  Si 

11  Pho  Sai 

12  Rawe 

13  Rai  Tai 

14  Samrong 

15  Ang  Sila 

16  Fang  Kham 

71  Phibun  Mangsahan 
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01  -  Attack 


02  -  Firef ight 


03  -  Ambush 


04  -  Harassment 


05  -  Terrorism 


APPENDIX  C 
INCIDENT  CODES  AND  DEFINITIONS 

An  action  in  which  the  CT  use  both 
gunfire  and  maneuver  in  an  attempt 
to  seize  an  objective. 
An  action  in  which  the  CT  use  gunfire 
alone  in  an  attempt  to  inflict 
casualties  and/or  property  damage. 
A  surprise  attack  usually  conducted 
from  planned  positions,  against  a 
moving  or  temporarily  halted  troop, 
unit,  or  convoy. 

An  action  in  which  the  primary  objective 
of  the  CT  is  to  temporarily  disrupt  the 
activities  of  a  unit,  installation,  vil- 
lage or  activity  rather  than  to  inflict 
serious  casualties  or  damage.  Examples 
are  burning  of  crops,  sniping  at  a 
patrol,  harassing  gunfire,  booby  traps, 
and  mining  incidents  of  a  minor  nature. 
An  action  directed  against  civilians 
or  military  personnel  not  engaged  in 
military  duties  in  which  the  primary 
objective  is  to  intimidate.  This 
includes  assassinations,  kidnappings, 
mining  or  bombing  of  public  buildings. 
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06  -  Sabotage 


07  -  Armed  Propaganda 


08  -  Anti-Aircraft  Fire 


09  -  Clash 


10  -  Armed  Procurement 

19  -  Other  (Violent) 

20  -  Organization 

21  -  Meetings 

22  -  Recruitment 

23  -  Strength 


Normally  a  covert  action,  and  causing 
damage  or  destruction  to  essential 
facilities  or  property  so  that  vital 
public  services,  communications,  or 
transportation  are  put  out  of  commission. 
An  effort  by  armed  CT  groups  to  indoctri- 
nate civilians  in  order  to  produce  a 
favorable  or  neutral  attitude  toward 
the  CT,  or  to  implant  doubt  about  RTG 
concern  for  or  ability  to  protect  its 
citizens . 

Gunfire  from  the  ground  directed  at 
aircraft  in  flight. 
An  armed  encounter  between  RTG  forces 
and  the  CT,  usually  as  the  result  of 
RTG  patrol  or  sweep  activities. 
CT  procurement  of  supplies  by  means 
of  force. 

Any  type  in  the  narrative. 
Leadership,  internal  structure, 
designations . 

Meeting  of  CT  groups  for  known  or 
unknown  purposes. 
Methods,  membership  requirements, 
admission  procedures,  recruitment  per  se. 
Of  CT  groups. 
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24  -  Defections 

25  -  Sightings 

26  -  Training  Outside 

Country 

27  -  Training  in  Country 

28  -  CT  Camp/Base 

29  -  Deployment 

30  -  Finances 

31  -  Logistics 

32  -  Cannunications 

33  -  Arms  Cache 

34  -  Arms  Smuggling 

36  -  Infiltration 

37  -  Espionage 

38  -  Counterfeiting 

39  -  Propaganda 

(non-armed) 

40  -  Captured  CT  Documents 

41  -  Captured  CT  Material 

42  -  Arrests,  Capture 

43  -  Release 

44  -  Helicopter  Sighting 

45  -  Helicopter  Landing 
50  -  CT  Plans 

99  -  Other  (Nonviolent) 


Of  CT  members. 

Of  individuals  or  groups  of  CT. 

Of  personnel  to  be  used  in  Thailand. 

Doctrine,  Tactics,  Subversion. 

Location  and/or  description. 

Order  of  battle. 

Contribution,  Extortion,  Taxation. 

Supply  Methods,  Sources  Routes, 

Material,  Suppliers. 

Methods  and  routing  including  couriers, 

Stockpiles  of  weapons. 

Personnel  augmentation  from  outside 

country. 

Includes  CT  asking  about  RTG  forces. 

Leaflets,  broadcasts,  word  of  mouth, 
themes. 


Of  CT  suspects. 
Of  CT  suspects. 


Proposed  future  actions  of  a  violent 

nature. 

Include  suspect  CT. 
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APPENDIX  D 
VARIABLES  USED  IN  ANALYSIS 
Incident  Code 
RTA  Reported 
Civilian  Reported 
Police  Reported 
Paramilitary  Reported 
Source  Reported 
Unknown  Reporter 
RTA  Initiated 
Police  Initiated 
Paramilitary  Initiated 
Communist  Terrorist  Initiated 
Unknown  Initiator 
RTA  Target 
Civilian  Target 
Police  Target 
Paramilitary  Target 
Questionable  Target 
Day  of  Month 
Month 

Armed  or  Uniformed 
Pathet  Lao  Involved 
Tribes  Involved 
Vietnamese  Involved 
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24 
25 
26 
27 


Other  Ethnic  Groups  Involved 
Number  of  Communist  Terrorists 
Number  of  Non-Communist  Terrorists 
Hour  of  Day 
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APPENDIX  E 

COMPUTATIONAL  PROCEDURE,  BIMED  03M 

Step  1.  The  data  re  in  the  form  x.  . ,  (i  =  1,2,  ...,  n,  cases) , 

(j  =  1,  2,  ...,p,  variables).  The  means  are  computed  by 

n 


X.  =  E  X.  ./n 
^  i=l  ** 


the  standard  deviations 


• n Z 

S.  =    E   (X.  .  -  X.)2 


i=l 


iD    J 


n-1 


and  the  correlation  coefficients 


n 


r.  .  = 


E   (X  .  -  X.)  (X  .  -  X.) 
,   ai    l    on    t 
a=l  J    J 


13  /IT 


n 


E   (X  .  -  X.)2  E   (X  .  -  X.)2 
,   ai    l    n   an    i 

a=l  a=l    J    J 

Step  2.  The  diagonal  elements  of  the  correlation  matrix  are  adjusted 
to  unity. 

Step  3.  The  p  eigenvalues  and  p  eigenvectors  of  the  adjusted  corre- 
lation matrix,  R,  are  obtained,  solving  the  system 

RV  =  V  A,  ^V  =  I 

where  A.  is  the  j   eigenvalue ,  and  6 .  =  (vn  . ,  v0 . . ,  v  . )  is  the 

"  J  J  1]   2]'      p]' 

j       eigenvector. 

Step  4.  The  program  determines  the  number  of  factors  to  be  rotated, 

q  ,  by  the  formula 

q  =  min(k,  m) 
where  k  is  the  number  of  eigenvalues  greater  than  c  (an  input  to  the 
program,  c  =  1.0) ,  and  m  is  a  specified  number  of  factors  (input  m  =  13) 
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Step  5.  The  coefficients  of  each  factor  are  obtained  by 

i  =  1,  2,  ...,  p 
j  =  1,  2,  ...,  q 


a.  .  =  /X7  v.  . 
ID     D  ID 


and  the  p  x  q^ factor  matrix,  A  =  (a.  .)  is  printed. 

Step  6.  A  factor  check  matrix  is  computed,  given  that  p  ^  q,  by 

T 
C  =  A  -A 

where  C  is  a  p  x  q  matrix  containing  the  eigenvalues  on  the  diagonal. 

Step  7.  Orthogonal  rotations  are  performed  on  the  factor  matrix  to 

maximize 

V=  Z[pE(a?./h?  -  (E(a?./h?))2] 

.   .   IT   1     .IT   1 

D  i  i 

where  i  =  1 ,  . . . ,  p  are  variables ;  j  =  1 /  . . . ,  q  are  factors ;  and  h? 
is  the  communality  of  the  i   variable  defined  by 

h?  =  S  a? .   . 
1   j  1D 

Step  8.  The  normalized  factor  matrix  is  computed  by 


b.  .  = 
x3  / 


a.  . 
_   ID 


h? 

l 


Step  9.  A  computational  loop  is  started  through  Step  12  until  the 
convergence  test  is  passed  in  Step  9  and  the  program  transfers  to 
Step  13.  The  variance  is  computed,  factors  by 
Sj  =  [pzto^)2  -  (2b?j)2]/p2 


and  the  matrix 


V    =  Es. 
c      ±3 


C^  X   /         £*   /  ... 
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and  if  |v  -  V  -J  <  10 7  four  successive  times,  the  program  transfers  to 

Step  13. 

Step  10.  The  maximization  criterion  is  done  for  two  factors  at  a  time. 

If  x  and  y  are  column  vectors  of  normalized  coefficients  (factor 

loadings),  then,  treating  them  as  constants, 

Cos  $  -  Sin  <j> 
(x,  y)   x  [  ]     =     (X,  Y) 

Sin  <j>       Cos  <j> 

where  X  and  Y,  the  desired  normalized  loadings,  are  functions  of  <f>, 

the  angle  of  rotation.  Necessary  conditions  for  a  maximum  are: 

A  =  E  (x.  +  y.)  (x.  -  y.) 

B=2E  x.y. 
lri 

C  =  Z  [  (x.  +  y. )  (x.  -  y. )  +  2x.y.  ]  [  (x.  +  y. ) 

(x±  -  y.)  =  2x^.1 

D  =  4  E  (x.  +  y.)  (x.  -  y.)  x.  y. 

NUM  =  D  -  2  A  B/p 
DEN  =  C  -  (A  +  B)  (A  -  B)  /p 
c))1   =1/4  AKCTAN  (NUJVDEN) 
If  (J)1^  1/4  degree  the  program  goes  to  Step  12;  otherwise,  to  Step  11. 
Step  11.  If  DEN  is  positive,  the  program  computes: 
|  Cos  <j)  |  =  Cos  (J>  * 
|  Sin  <j>  |  =  Sin  <f> ' 
otherwise,  it  computes: 

|Cos  <j>|  =  (/272)  (Cos  <J>'  +  Sin  $') 
I  Sin  <J)  I  =  {SZ72)  (Cos  cj)1  +  Sin  (J)1) 
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and  if  NUM  is  positive,  the  program  computes: 

Cos  <f>  =  |  Cos  <f>  | 

Sin  (J)  =  |  Sin  4>  | 
otherwise,  it  computes: 

Cos  cj)  =   |  Cos  <$>  | 

Sin  (J)  =  -  |  Sin  <J>  | 
and  goes  to  Step  12. 

Step  12.  One  iteration  cycle  is  completed  when  all  pairwise  single- 
plane  rotations  (Steps  10  and  11)  are  made  on  the  normalized  factors; 
the  program  then  goes  to  Step  9. 
Step  13.  The  final  rotated  normalized  factors  are  unnormalized  by 

a.  .  =  b.  .h. 
13    id  l 

and  the  rotated  unnormalized  factor  matrix,  A,  is  printed.  Final  com- 

munalities  and  the  difference  from  initial  coninunalities  are  computed 

and  printed,  and  the  original  and  successive  variance  are  printed. 
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APPENDIX  F 
COMPUTATIONAL  PROCEDURE,  BIMED  X72 
Step  1.  The  data  are  in  the  form  X.  .,  (i  =  1,  2,  . ..,  n,  cases) , 
(j  =  1,  2,  . ..,  p,  variables).  The  data  are  read  casewise  and  the 
method  of  provisional  means  is  used  to  compute  cross  products 
recursively  by 

(k) 


-(k+1) 


x. 

l 


(k+1) 


x.   +  D. 
l      l 


(k) 

=  S.  .   +  k(k+l)D.D. 

ID  1] 


-  (k) 
Di  =  (xk+l,i  "  Xi   J/^W- 

The  means,  convariances ,  standard  deviations,  and  correlations  are 

given  by 

-(n) 

x.  =  x.   , 
l    l 

(n) 

s. .  =  S. .  /n, 
ID    ID 


1/2 


s.  =  (s..) 
i     li 


and 


r. .  =  s. ./s.s.  . 
ID    ID  i  D 

Step  2.  As  required  for  initial  communality  estimates,  the  diagonal 

of  the  matrix  (r. .)  is  left  unaltered. 
ID 

Step  3.  The  matrix  (r. •)  is  tri-diagonalized  by  the  method  of  House- 
holder, its  eigenvalues  obtained  by  means  of  Sturm  sequences,  and  its 
eigenvectors  found  by  the  method  of  Wilkinson.  All  of  the  eigenvalues 
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A_  ,  A-,  . . . , _  A  are  obtained,  the  number  r  of  eigenvectors  v, ,  v~, 
. . . ,  v  is  equal  to  the  min  (a ,  and  max  (i :  A.  >c) )  where  a  is  the 
specified  number  of  factors  (input  a  =  13)  and  c  is  the  specified 
minimum  eigenvalue  (input  c  =  1.0) .  The  j   column  of  the  factor 
loading  matrix  (k.  .)  is  A. v.  .  New  communality  estimates  are  given  by 

h?  =  Zk? .   . 

If  more  than  one  iteration  for  communalities  is  required,  the  diagonal 
of  (r. .)  is  replaced  by  the  new  estimates  and  this  step  is  repeated 
from  the  beginning.  Iteration  is  continued  until  a  specified  maximum 
number  of  iterations  or  until  the  maximum  change  in  the  communality 
estimates  h?  is  less  than  .001.  Initial  and  final  communality 
estimates  are  printed  together  with  the  final  factor  loadings. 
Step  4.  Kaiser  normalizations  are  applied  to  the  loadings  by 

k! .  =  k. ./h.  , 
ID    ID  i 

and  rotations  are  performed  to  minimize  the  criterion 

P«        l    P       P 
G((k..))  =     (  p  k?  k?.  -  -  (    k?)(    k?J) 

^     afr     i=l  iai*   P   i=l  ia   i=l  * 
where  1/p  specified  an  orthogonal  varimax  rotation.  Rotations  are 
made  by  pairwise  modifications  of  columns  of  (k.  .)  through  all  pairs. 

Complete  cycles  are  continued  until  the  change  in  G  relative  to  its 

-5 
initial  value  is  less  than  10  .  The  Kaiser  normalizations  are 

corrected  for  by  the  inverse  relation 

k!  .  =  k.  .h. 
ID    ID  i 

and  the  final  values  of  k. .  are  the  primary  factor  loadings. 
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APPENDIX  G 
INITIAL  DATA  FACTOR  PATTERNS  (U) 
Total  Data  Factor  Names  and  Numbers 


Variables 
No.      Name 

1  Incident  Code 

2  RTA  Reported 


11  CT  Initiated 

12  Unknown 

Initiator 


<£! 


3 

Civilian 
Reported 

4 

Police  Reported 

5 

Paramilitary 
Reported 

6 

Source  Reported 

7 

Unknown 

Reporter       L- 

8 

RTA  Initiated    VH 

9 

Police  Initiated 

.0 

Paramilitary 
Initiated 

VH 


1 

0 


I  B 

1  2 

M-  H- 
VH 


VH 


13  RTA  Target 

14  Civilian  Target   L-   VH 

15  Police  Target 


H 

cu 


0) 

o 

■H 
i-H 
O 
PL, 


L-       L- 
VH 

VH 


VH 


VH 


w 


5 
L 


L- 


a 


VH- 


VH 


CO 
Q 


s    s    s 


VH- 


! 

S 


a 


L- 


VH 


L- 


-p 

I 

(1) 


9 

I 


10       11 


VH 
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CONFIDENTIAL 


■> 


1 

o 

No. 

Name        1 

2 

16 

Paramilitary 
Target 

17 

Unknown  Target 

18 

Day 

19 

Month 

20 

Arms  or  Uniforms 

21 

Pathet  Lao 

22 

Tribesmen 

23 

Vietnamese 

24 

Other  Ethnic 
Groups 

25 

Number  of  CT 

26 

Number  of  Non-CT 

27 

Hour 

CM 


VH 


o 

•H 
H 
O 

04 


4 


VH- 

L- 

M- 


VH 


w 


S  D 

6  7 


-p 

Q 


VH- 


H 
L 


M- 


H- 


H 


a 


< 


M 


I 

id 
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VH 


L 
VH 
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Amphoe  Chiang  Khong  Factor  Names  and  Numbers 


Variables 

IS 

0 

•H 

B 

Pn 

O 
-H 
iH 

0 

04 

No. 

Name 

i 

2 

3 

4 

5 

1 

Incident  Code 

M 

H- 

L- 

2 

RTA  Reported 

VH- 

3 

Civilian 

Reported 

L 

L 

L 

L 

4 

Police  Reported 

V 

5 
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